International Journal of Data Science and Big Data Analytics
|
Volume 1, Issue 2, May 2021 | |
Research PaperOpenAccess | |
Federated learning for privacy-preserving data access |
|
Malgorzata Smietanka1*, Hirsh Pithadia2 and Philip Treleaven3 |
|
1University College London, Gower St, Bloomsbury, London WC1E 6BT, United Kingdom. E-mail: malgorzata.wasiewicz.17@ucl.ac.uk
*Corresponding Author | |
Int.J.Data.Sci. and Big Data Anal. 1(2) (2021) 1-13, DOI: https://doi.org/10.51483/IJDSBDA.1.2.2021.1-13 | |
Received: 20/12/2020|Accepted: 13/04/2021|Published: 05/05/2021 |
Federated learning is a pioneering privacy-preserving data technology and also a new machine learning model trained on distributed data sets. Companies collect huge amounts of historic and real-time data to drive their business and collaborate with other organizations. However, data privacy is becoming increasingly important because of regulations (e.g., EU GDPR) and the need to protect their sensitive and personal data. Companies need to manage data access: firstly within their organizations (so they can control staff access), and secondly protecting raw data when collaborating with third parties. What is more, companies are increasingly looking to ‘monetize’ the data they’ve collected. However, under new legislations, utilizing data by different organization is becoming increasingly difficult (Yu,2016). Federated learning pioneered by Google is the emerging privacy- preserving data technology and also a new class of distributed machine learning models. This paper discusses federated learning as a solution for privacy-preserving data access and distributed machine learning applied to distributed data sets. It also presents a privacy-preserving federated learning infrastructure.
Keywords: Federated Learning, Privacy, Machine Learning, Data Science, InsurTech
Full text | Download |
Copyright © SvedbergOpen. All rights reserved